Generating Ground Truthed Dataset: Automatic or Semi-automatic?

نویسندگان

  • Weihua Huang
  • Chew Lim Tan
  • Jiuzhou Zhao
چکیده

Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating chart image dataset and multilevel ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dataset as well as the ground truth data are publicly available so that other researchers can access them for evaluating and comparing performances of different systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mapping Transcripts to Handwritten Text

In the analysis and recognition of handwriting, a useful first task is to assign ground truth for words in the writing. Such an assignment is useful for various subsequent machine learning tasks for performing automatic recognition, writer verification, etc. Since automatic word segmentation and recognition can be error prone, an intermediate approach is to use a text file that is a transcripti...

متن کامل

A Framework for Evaluating Underwater Mine Detection and Classification Algorithms Using Augmented Reality

This paper presents a novel framework for evaluating Target Detection and Classification algorithms and concepts of operations based on Augmented Reality (AR). Real sonar images and synthetic target models are used to generate a ground-truthed AR theatre of operation. The detection/classification results of the human operator or Automatic Target Recognition (ATR) algorithm to be evaluated are t...

متن کامل

Automatic Prostate Cancer Segmentation Using Kinetic Analysis in Dynamic Contrast-Enhanced MRI

Background: Dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) provides functional information on the microcirculation in tissues by analyzing the enhancement kinetics which can be used as biomarkers for prostate lesions detection and characterization.Objective: The purpose of this study is to investigate spatiotemporal patterns of tumors by extracting semi-quantitative as well as w...

متن کامل

The BEHAVE video dataset: ground truthed video for multi-person behavior classification

Although there is much research on behaviour recognition in time-varying video, there are few ground truthed datasets for assessing multi-person behavioral interactions. This short paper presents the BEHAVE project’s dataset, which has around 90,000 frames of humans identified by bounding boxes, with interacting groups classified into one of 5 different behaviors. An example of its use is also ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007